CDS

Accession Number TCMCG018C23052
gbkey CDS
Protein Id XP_031738304.1
Location join(35903910..35903989,35909783..35909850,35910102..35910173,35910357..35910706,35910941..35911009,35911484..35911572,35911671..35913821,35913909..35913986,35914143..35914254,35914824..35914874,35914968..35915018)
Gene LOC101213543
GeneID 101213543
Organism Cucumis sativus

Protein

Length 1056aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA182750
db_source XM_031882444.1
Definition dentin sialophosphoprotein isoform X3 [Cucumis sativus]

EGGNOG-MAPPER Annotation

COG_category S
Description Occludin homology domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
KEGG_ko ko:K11807        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGAGATCAAGCGGGTTGAGGCGCAAGGCGGGACTCCGAGGATTAAGTTTGATGCCAACGCCAATAATTCTAGTGGTAATAGCTATCGTTTTAGAACCTGGGAATCCATCAATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGCTAATCCGTGGAGGCATTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGTTGGGCCTCCAAAATCTACATATAAACCTGGTATGCCATCGTTACCTGCTTCTAAGGATAGGCTATCATCTTCACCTATTCCGTTGCCACCTGAGCAATTCGGTGCTCCAGTATCTCAATTTGGGTCTGCAAACACCAGTAAGACCCATGTTATTGCAGAAGATATTAGACCTCGAGTACCTGCTAAGATTAATCCTGCTGCTAGCAACGAGAAGGAAATCCCGACCATAGCTCCAAAAGGAGTACTTGAAACACCAGGACAGGAAGGGAATAGTGGAACTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTTGGAGAATCCCAAGGGGATGAGTTTGAAGGCCTTGGAGAAAGCTGTTGGCGATAAAATCCCAAATGCGGTGAAAAAGATTGAGCCAATCATTAAAAAAATTGCAACCTACCAAGCTCCAGGGAGATATCTTTTGAAATCAGGAGTTGGGTTGGAAGGCTCTAAAAAACCTACATCGGAAGGTGAAAGCTCTCCTTTGATCAGCCATCACCAAACCTCTGTACATGAAGACCTCCCTGATCAAACAAATGCTCCAGAATTGCAGTTAGAAGCAAGATGTGGCATGGATTTGGAGGAAAAGGTAGAAACCTCTCAAGCTAACAAAGAATCAAATTTCTTGGAGACAAATGGCATCCAACAGCCTGATCCTTTTGCTGAGAAAAAAAGTTCCGAAAATAGTGAAGGCCAGGCAGCTAGTTCTTCTGACAACGAAAGTGACAGTGATTCTGATAGTGATAGTAGTGATAGTGGAAGCGATAGTGGGAATCATAGTAGGAGTAGAAGCCGAAGCCCTGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGGGCCTTCCAATAGCCAGGAGGGTTCTGATGTGGACGTGGATATCATGACCAGTGACGATGACAAAGAATCCAAGCAGAAATTGCAAGCTTCTGTGCAGGGTTTCTCTACATCCCCTGCTGCTTGGAAAAGTCCAGATGGTGGGCCTGTGCAGATTATAGATGATGAGAAGGAAGACGGACAAGAATATGATGCAATTGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGGTCGTAGTTTACTTCCTACTGAAGAAGGTGTAAGACCCGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAGCGCCAAAATTTTATTGGGAGTTTGTTTGAGGATAGGGAAAATAATGTTGTGGACAGTGCCAGGCATGAACAATCTGACAGCACAGGTCGAATATCTAAAGGCAAATCTAAAAGAAGTTCTGATCTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCTGAAAGCTTAGCCCAACAACCAGTTTCTGGTAATTGGGGAGTCCAATTACAGAGTCCTCGCAATTTATCTCCTAGTAAACTCAATAGAGATTCTGTCAGAAATCTTACAAGTCAAGTTACTAATAAAGGGGAAATTAAAGGCAATTCTGACTTTAGACCAAAAAAGGGAAACAAAGAAACAGTTTCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCTCATGACCAAAGTGGAGTGAGGGCTGTAGATACAGCTACCAGAGCCGACAAGCATGGTGATATTGGGCGTGGCACTAAACACACTGAAAAGAGTGGCCATGCTAATGAAAATTTTCATGTGTTCAAAGATACATTTTATGGAAACCCTGATAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGTGGGCCAGGGGACAAACAGATACAACCTTTGGACTCCCATCACAGTAAACCTGGTGAAATAGTTGGTAAATTCAAAGATGGCCAAACATTTTCAAGCTCACAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTTAGTGCCAACAGGTCCCCAGTTAATGGAAAAGGTCGAATTCTACAAAGAGAGCCTTCAGACCTGGAGTTAGGTGAACTTCGGGAGCCTTTCCACGAGGAAGCACGGGGTAAAAAGAAATTTGAAAGAAACAATTCGTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGGGTTCAGATTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTTTAGAATATGGAAAGCGGTCGTCACCCCATGTAAGTACTAAGTTTCCCAGCAATCCGGAAGGCTCAAATAAAAAGAAAAATTCAGAACATATAGTTGAAGATTCTAACAGGATTAACAACCGGTCTTTGCTCTCTCATTCACAATATAATTCAAGAATAGATCATGCTGAAGTCGACAAGTCAGCTGATGGAAATGTAAAACCTAATCAAGGGAATGGTCCAGAAGGCTATGTGGAAAGCAACAGAAAAGCTTCTGTTGGCATTTCCCAGCTGAATGATACAAAAAGAGAACAGCCTCCCTCAAAAAAAGGAAGTAAAAGACAAGCACCTAATCCAATAACCGAAGTTACTGATGGACTTAAAAACCCAGTATCAGCTGAGCGTGAAAATAGCGATCCAAAGAGACGAGATTCCTCTTCAGACGAAAATAGTTGTTCATATTCCAAGTATGAAAAGGATGAGCCAGAGTTGAAGGGAGCAATCAAGGATTTTTCTCAGTACAAGGAATATGTACAGGAGTATCATGATAAATATGAATCATACCTATCTTTGAACAAAATCCTAGAAAGCTACAGGACTGAGTTCTGCAAACTCGGGAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAAATACTTTAATGTCTTAGGACAGTTGAAAGAATCCTATCGGCTGTGTTCAACGAGGCACAAGAGATTGAAAAAAATATTCATTGTTCTCCACGAAGAGCTGAAGCATATAAAGGAAAGGATTAGAGATTTTGTACAAACATATGCAAAAGATTAA
Protein:  
MRSSGLRRKAGLRGLSLMPTPIILVVIAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGAPVSQFGSANTSKTHVIAEDIRPRVPAKINPAASNEKEIPTIAPKGVLETPGQEGNSGTKPTDLQGMLYNLLLENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYLLKSGVGLEGSKKPTSEGESSPLISHHQTSVHEDLPDQTNAPELQLEARCGMDLEEKVETSQANKESNFLETNGIQQPDPFAEKKSSENSEGQAASSSDNESDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESDGPSNSQEGSDVDVDIMTSDDDKESKQKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDGQEYDAIDIEKDSSDDEPDAKIDGRSLLPTEEGVRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENNVVDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGVQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVSQAGWRPHDQSGVRAVDTATRADKHGDIGRGTKHTEKSGHANENFHVFKDTFYGNPDNEGTKEKKVSKNSRSGGPGDKQIQPLDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSANRSPVNGKGRILQREPSDLELGELREPFHEEARGKKKFERNNSLKQLENKENTTDIWGSDLNKGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSNRINNRSLLSHSQYNSRIDHAEVDKSADGNVKPNQGNGPEGYVESNRKASVGISQLNDTKREQPPSKKGSKRQAPNPITEVTDGLKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCSTRHKRLKKIFIVLHEELKHIKERIRDFVQTYAKD